The area law and real-space renormalization 
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Real-space renormalization-group techniques for quantum systems can be divided into two basic categories 
— those capable of representing correlations following a simple boundary (or area) law, and those which are 
not. I discuss the scaling of the accuracy of gapped systems in the latter case and analyze the resultant spatial 
anisotropy. It is apparent that particular points in the system, that are somehow 'central' in the renormalization, 
have local quantities that are much closer to the exact results in the thermodynamic limit than the system-wide 
average. Numerical results from the tree-tensor network and tensor renormalization-group approaches for the 
2D transverse-field Ising model and 3D classical Ising model, respectively, clearly demonstrate this effect. 

PACS numbers: 



I. INTRODUCTION 

Solving large, quantum mechanical systems is very chal- 
lenging, primarily because the dimension of the Hilbert space 
describing a system with many components grows exponen- 
tially with the number of components. Direct approaches to 
such problems, for instance by exact diagonalization, quickly 
become intractable, even for relatively small 2D and 3D quan- 
tum systems. Quantum Monte Carlo (QMC) is another di- 
rect approach (exact up to statistical error), but the sign prob- 
lem causes difficulties for many systems of interest — such as 
fermionic, frustrated, or dynamical problems. 

Thus, in order to garner meaningful information about 
large quantum systems, clever approximations need to be em- 
ployed. In this vain, many analytic and numeric techniques 
have been developed over the last 80 years. In this paper, I will 
focus specifically on numerical real-space renormalization- 
group (RG) techniques, which can be applied to both quantum 
and classical problems. 

The process of renormalization takes a divide-and-conquer 
approach to the problem, by tackling different parts of the 
system (or Hilbert space) separately or in succession, care- 
fully simplifying or compressing the pieces at each step. In 
real-space RG, the renormalization procedure groups together 
spatial regions of the system, referred to as blocks. At each 
step, two neighboring blocks are combined, and then simpli- 
fied (for instance, by truncating the Hilbert space). As the 
renormalization proceeds, the blocks include larger and larger 
portions of the initial system, until the entire system of inter- 
est is encapsulated. If the RG reaches a fixed-point, we can 
say we have reached the thermodynamic limit. 

One of the most successful real-space RG algorithms is 
the density-matrix renormalization group (DMRG) [1], which 
very accurately describes (quasi-)lD systems. In this ap- 
proach, a single site is added to the block at each step, and 
the optimal Hilbert space (limited to some dimension x) for 
the combined system is determined. Working in reverse, the 
procedure generates a variational wave-function known as a 
matrix-product state (MPS) (see Fig.Q](a)) 0. DMRG can 
then be thought of as an optimization algorithm that sweeps 
over the tensors in the MPS, targeting the state with lowest 
energy. 

A similar approach is the tree-tensor network (TTN) 0], 




(a) □ 



FIG. 1 : (a) Depiction of an MPS (in the unitary gauge) pictured in 
terms of renormalized Hilbert spaces. Each tensor adds a physical 
site (bottom) and passes the Hilbert space (truncated to x) upwards, 
(b) The ID TTN for 8 sites. Each tensor combines and renormalizes 
two neighboring blocks, (c) A single layer of the 2D TTN, where a 
2x2 square is renormalized into a single site, first by combining in 
the x then y directions. 



in which neighboring blocks are successively combined, de- 
picted in Fig.Q](b). In this direct coarse-graining approach, 
the physical volume of each block doubles at each step. This 
ansatz is used less frequently than MPS/DMRG because the 
numerical cost is higher for a given amount of system en- 
tanglement or accuracy (0(x 4 ) vs. 0(x 3 ))- The tensor net- 
work can easily be extended to higher-dimensional systems 
(seeFig.[TJ(c))ffl. 

The major problem of using the TTN (or MPS/DMRG 01) 
in two- or higher-dimensions is that they do not respect the 
area-law for entanglement entropy with fixed x Ull ■ Gen- 
erally speaking, gapped phases of local Hamiltonians are ex- 
pected to have 'local' correlations. Thus, the amount of entan- 
glement between a large, contiguous block and the rest of the 
system should scale proportional to the boundary area separat- 
ing the regions. On the other hand, wave-functions generated 
by MPS or TTN contain arbitrary large blocks with bounded 
entanglement (depending on %), and therefore have poor over- 
lap with the true ground state. 

The area law has motivated several tensor-network ansatze 
to describe higher dimensional systems, in an attempt to repli- 
cate the success of DMRG. One example is the projected 
entangled-pair state (PEPS) 0-[l6|], which can be thought of 
as a higher-dimensional generalization of MPS. The multi- 
scale entanglement renormalization ansatz (MERA) lfl7l - [l9ll 
adds additional local entanglement to the TTN, and in its sim- 
plest form (c.f. l20ll ) exactly replicates the area law in two- 
or higher-dimensional systems. The drawback of these ap- 
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proaches has been the large numerical cost, typically scaling 
as x 10 or greater. Fortunately, one expects that as compu- 
tational power increases, the accuracy of these ansatze will 
increase superpolynomially with \ (for gapped phases). 

In the mean-time, there is immediate demand for tech- 
niques with lower computational cost. Some approaches that 
have been tried recently include entangled plaquette states, 
and performing variational Monte Carlo over tensor network 
states I2II - I28I1 . Tensor network techniques that do not obey 
area laws have been used extensively in recent studies of 2D 
quantum systems, including DMRG in a cylindrical geom- 
etry iH I23, [l^l, TTN iU, and direct approximate contrac- 
tions of the 3D Suzuki-Trotter decomposition using tensor 
renormalization-group (TRG) and its variants I131h35I1 . In 
these approaches, a description of a locally-correlated state 
would require a bond-dimension that grows with system size 
(e.g. exponentially with cylinder width in 2D DMRG). 

There are two possible approaches to take in these cases: 
(a) use a small, finite geometry while keeping track of all cor- 
relations; or (b) study a large system using an ansatz with in- 
sufficient entanglement or bond-dimension \- The first ap- 
proach is typically used because finite-size effects are well- 
understood, while finite-x effects are less clear in the severely 
undersaturated regime. The purpose of this paper is to ana- 
lyze, in generality, the scaling of accuracy in approach (b). 
One reason for this to be important is that approach (b) is 
implicitly used in the 3D classical tensor renormalization- 
group (or (2+ ID) quantum TRG using a 3D representation 
of imaginary-time evolution) — a promising technique that 
recently demonstrated accuracy competitive with large-scale 
Monte Carlo studies JH]. 

We see that following a naive approach, the global accu- 
racy of (b) scales only logarithmically with x> ar, d thus a ls° 
logarithmic in the numerical cost. On the other hand, pro- 
vided proper care is taken, the accuracy of local observables 
scales polynomially with x- This exponential improvement 
puts the approach (at least formally) on similar grounds to 
finite-size scaling of small systems using DMRG. The 'trick' 
here is to realize that the anisotropic structure of the renor- 
malization means that not all sites are equal. Some sites are 
far away from the boundaries of the renormalization, and are 
able to share ample entanglement with their surrounding envi- 
ronment. In this paper, ideally located sites are called 'center' 
sites. 

The paper is structured as follows. In Sec. [TTJ the 2D 
tree-tensor network is analyzed in-detail, with arguments for 
the above scaling backed up by numerical results for the 
transverse-field Ising model on the square lattice. A simi- 
lar analysis for the tensor renormalization-group in higher- 
dimensions is shown to hold in Sec. [HI] with numerical evi- 
dence from the (classical) 3D Ising model (which is related to 
the (2+l)D quantum model). The paper concludes in Sec.HVI 
with an outlook on some possible future directions. 
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FIG. 2: (Color online) Blocks of different layers in the 2D TTN. The 
marked 'center' site at coordinates (6,6) is furthest from the block 
boundaries. This should be the site where the Hilbert space of its 
immediate environment is largest, and closest to the bulk, for all %■ 

H. TREE TENSOR NETWORK 

In this section we analyze the tree tensor network on an 
L x L square lattice. Here we use the 2-to-l renormaliza- 
tion scheme depicted in Fig.[TJ(c), alternating course graining 
along the x and y dimensions. The cost of contracting the 
tensor network corresponding to the expectation value of a 
nearest-neighbor Hamiltonian, (\E'|.H"|\I'), scales as 0(\ 4 L), 
and similarly for calculating the derivative of the energy with 
respect to the tensor variables. 

The wave function \ty) generated by the TTN contains 
blocks of size I x I (where I = 2™, for some integer n) that 
contain bounded entanglement (Schmidt rank \) with the rest 
of the system. These blocks are highlighted in Fig. [2] In two- 
dimensions, this fails to saturate the area law, which demands 
that the entanglement entropy should scale linearly with the 
perimeter of the block, requiring a Schmidt rank scaling as 
exp(aZ), for some constant a. 

We now proceed to analyze the structure of the wave- 
functions having minimal energy. For large enough system 
size L, the bond dimension will be insufficient to describe the 
entanglement of the entire system, i.e. \ < exp(aL), and 
thus the minimum energy wave-function must be distinct from 
the true ground state. On the other hand, \ will be sufficient 
to describe the entanglement within the smaller blocks. Errors 
will accumulate primarily because of the lack of entanglement 
between larger blocks. 

Somewhere between these two extremes there will be a crit- 
ical block-size I*, where smaller blocks have sufficient en- 
tanglement and are thus well-renormalized, whereas larger 
blocks do not possess large enough \- This is the point where 

X~exp(aP). (1) 

Because the blocks grow exponentially in size as a function 
of layer n, and the required bond-dimension therefore grows 
doubly-exponentially in n, we expect the transition between 
sufficient entanglement and woefully inadequate x to be quite 
sharp. 

To simplify the analysis, we compare this situation to a 
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cluster-mean field theory. In this theory, the full Hilbert space 
of clusters of size I* x I* is included, while no entanglement 
exists between neighboring clusters. In practice, a single clus- 
ter is exactly-diagonalized in a self-consistent fashion with 
their boundary conditions, and the fixed-point corresponds to 
the lowest energy state in the cluster mean-field ansatz. 

The error of global quantities such as the total energy can 
be reasonably large when using this approach. Let's assume 
that the system has correlation length £, and that the expecta- 
tion value of a local quantity decays exponentially towards the 
(correct) bulk value away from the cluster boundaries. In the 
limit that I* > £, some fraction of the system will be 'close' 
to the boundaries (closer than £) and display incorrect results, 
while the remaining 'bulk' fraction will display roughly the 
correct results. In d-dimensions, the fraction of 'error' sites 
scales as 2d£/Z*, and thus the error of a global quantity in the 
2D TTN scales as 

global error oc — ~ (2) 

I* lux 

where Eq. ([TJ was used in the second relation. The error of a 
global quantity, such as the total energy or magnetization, thus 
only decreases logarithmically with \ and thus computation 
effort. 

Although this scaling is quite poor, the ansatz takes some 
advantage from the localized entanglement in the system and 
is already a large improvement over exact diagonalizationfl. 
On top of this, we can estimate local quantities with greatly 
reduced error by understanding the structure of the ansatz. 
Points inside the well-renormalized, 'bulk' region are sur- 
rounded by an immediate environment that is a good approxi- 
mation of the true ground state (provided, again, that I* > Q. 
As mentioned earlier, in a gapped phase the effect of the 
boundary should decay exponentially, so in the centre of the 
region the error should scale as exp(— I* /2£). This time, in- 
cluding Eq. ([TJ gives 

local error ~ exp (^T^J ~ ex P ( ~ X ^ ^ 

for some (3 that depends on the specifics of the system (no- 
tably, becoming smaller for more entangled systems or those 
with larger correlation length). Thus the scaling of error with 
computation effort is polynomial — an exponential improve- 
ment on the error of global quantities. If (3 is very large, the 
method could become competitive with well-established tech- 
niques such as quantum Monte Carlo (where statistical error 
scales as the square root of computational effort, assuming no 
sign-problem). The performance will degrade significantly in 
systems with large amounts of entanglement, or close to criti- 
cal points. 

To investigate the above numerically, we identify points in 
the system that are somehow 'central' to the renormalization, 
independent of system parameters or \- In the TTN, these are 
the points that are renormalized best with their environment, 
favoring no particular direction. In the 2D TTN, we define 
these to be the points that have been successively combined 
with the block above, to the left, below, to the right, ad infini- 
tum (see Fig. |2). For any choice of x, such a point has the 



largest immediate environment whose sites are described by a 
sufficiently large Hilbert space, roughly equal in all directions. 

We have implemented the 2D TTN to study the spin- 1/2 
transverse-field Ising model on the square lattice, described 
by Hamiltonian 

h = -^2 did* + hdf, (4) 

<i,j> 

where <i,j> denotes neighboring sites and h is the strength 
of a transverse magnetic field. Because the cost of the algo- 
rithm cost scales as 0{\ L), a moderate size L = 32 is used 
in order to investigate a system that is much larger than can 
be described exactly, while small enough to allow a range of 
X to be used. To minimize the energy, I have employed the 
time-dependent variational principle 0611 and used a unitary 
gauge, finding a significant speed-up compared to the tradi- 
tional, SVD approach 0]. 

In Fig.[3]the error of the magnetization in the x direction (as 
compared to the best, center site estimate) is displayed. We 
can clearly observe a pattern of successive 'windows', where 
errors are displayed primarily on the edges of blocks. For 
larger \, the size of the windows grow while the overall error 
decreases — while the central sites always remain inside the 
smallest window. At the critical point, close to h = 3.05, there 
is a large correlation length and the windows are somewhat 
blurred, and the errors are greater in magnitude for a given x- 

We compare the predicted value of (a x ) from the global, 
average value and the local value at the center site in Fig. |4] 
Away from criticality, we observe that the center site value 
conver ges to the quantum Monte Carlo prediction (using 
ALPS l37l l38tl ) significantly faster than the global average. 
At criticality, both values appear to be converging with a 
slow, 1/ In x scaling (extrapolation may be viable when L ap- 
proaches the thermodynamic limit). At h = 3.25 the behavior 
is not monotonic because the wave-function switches from the 
ferromagnetic to disordered phase at intermediate x- 

These results confirm the above analysis and show that it is 
viable to extract meaningful information in systems where the 
parameter x is severely undersaturated. 

HI. TENSOR RENORMALISATION GROUP 

A related tensor-based, real-space renormalization tech- 
nique is the tensor renormalization group (TRG). In this 
family of methods, the tensor network corresponding to the 
Markov network of a classical thermal state (whose contrac- 
tion gives the partition function) is successively contracted 
into larger and larger blocks. At each layer, the dimension 
of the tensors must be truncated to prevent the difficulty from 
growing exponentially. The process is depicted in Fig. [5] 

The method is well-suited to contracting 2D tensor net- 
works, such as thermal state of a 2D classical system, or the 
network that results from the Trotter decomposition of a ID 
quantum density matrix. For non-critical systems, the fixed- 
point of this RG flow is well-understood |32j] in terms of 'cor- 
ner' correlations. The TRG, and related 'second renormaliza- 
tion group' (SRG) methods are able to encapsulate the cor- 
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FIG. 3: (Color online) Spatial variations in the magnetization in field 
direction at various values of h, as predicted by the lowest-energy 
TTN with the displayed bond-dimensions \- The color values repre- 
sent differences to the prediction at \ = 112. In the gapped regions 
away from the critical point (near h = 3.05), we observe a 'win- 
dowing' pattern, with window size growing with x> and rapid con- 
vergence at the central points of the renormalization (at x, y = 21). 
When the correlation length is longer, the effect is blurred somewhat 
and convergence was not reached with a value of x = 112. 
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FIG. 4: (Color online) Predictions for the magnetization in the field 
direction with various magnetic field strengths h, as predicted by the 
lowest-energy TTN with given bond-dimension %■ The red crosses 
correspond to system-wide averages, while the blue points corre- 
spond to the central sites of the renormalization. The latter con- 
verge much faster than the former, away from the critical value of 
h « 3.05. The dashed lines correspond to QMC results for the sys- 
tem at temperature T = 0.005. In (a) the two lines represent the 
statistical uncertainty, while in (b-d) the error is comparable to the 
line width. 





FIG. 5: The contraction of the 2D Markov network (i.e. partition 
function) on the left is approximated by the TRG scheme on the right, 
from 13511 . In the 'second renormalisation group', the projectors (tri- 
angular tensors) are optimized to maximize the partition function. 
The above diagram can also be considered an ansatz for the prob- 
ability distribution itself, by opening additional legs on the Markov 
network tensors (circles) corresponding to the state at that site. 



relations of the 2D classical system quasi-exactly with fixed 
X- 

This technique has been successfully applied in a 3D classi- 
cal (or 2D quantum) setting, achieving impressive accuracies 
(see e.g. Ref. Ir35tl ). However, in these higher-dimensional 
settings, the edges of the 3D blocks grow in length as the 
renormalization proceeds, and strictly speaking \ would need 
to grow exponentially to encapsulate all the correlations in a 
(non-critical) system. In general, for this kind of blocking 
scheme in d dimensions, the correlations that need to be to 
be accounted for at each level of the renormalisation grows as 
L d ~ 2 . Thus, the TRG fails to account for what one might call 
a 'corner' law for correlations [|40ll4l1l . related to the quantum 
area law for entanglement entropy that manifests naturally in 
a (d + 1) -dimensional quantum theory, where one dimension 
is time. 

The structure of the TRG is strikingly similar to the TTN 
and one might suppose that a similar analysis in the limit of 
large 3D systems with insufficient \ might hold. Numerical 
evidence in the 3D Ising model appears to support this claim. 
I implemented the 3D higher-order SRG (HOSRG) approach 
from Ref. JUt, using an SVD-update flB to optimize the 
tensors to maximize the (global) partition function. The com- 
putational effort scales as 0(x n m -k)> an d we are able to 
study much larger systems, with L = 2 12 . In Fig. [6] (a) we 
see a clear indication of the 'corner' errors in the local en- 
ergy, in a similar fashion to the 'window' pattern in Fig. [3] In 
Fig-IU(b) the average energy and center site energy are com- 
pared for different values of x< and we see even close to the 
critical point (at T w 4.1) the local quantity converges to the 
Monte Carlo prediction much more rapidly. 



IV. DISCUSSION 

We have analyzed real-space renormalization procedures 
that are unable to account for local correlations in higher- 
dimensional quantum and classical systems. The anisotropy 
of tree-structured ansatze, such as TTN and TRG, can be taken 
advantage-of to identify sites that are central to the renormal- 
ization, providing an (exponentially more) accurate descrip- 
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FIG. 6: (Color online) Predictions for the 3D Ising model at T = 4 
(less than the critical temperature of T ~ 4.1) using the HOSRG ap- 
proach, (a) The bond-energy of the z-bonds through a cross-section 
of the x-y plane, where the value at the center site labeled x, y = 21 
has been subtracted. The 3D renormalization scheme displays er- 
rors predominantly on the corners of the renormalized regions, (b) 
The energy (per-site) for different values of x< compared to classi- 
cal Monte Carlo results. The red crosses correspond to system-wide 
averages, while the blue points correspond to the central sites of the 
renormalization. Similar to the quantum TTN, the latter converge 
much faster than the former. The dashed line corresponds to Monte 
Carlo simulations of a 48x48x48 lattice. 

tion of local quantities. The scaling of accuracy to numerical 
cost is expected to be polynomial (for gapped systems), for- 
mally putting the technique on a similar footing to 2D DMRG 
of small systems and quantum Monte Carlo (though in prac- 
tice the algorithms used here might not be as efficient). 

There has been some reluctance to use a severely under- 
correlated ansatz for a quantum wave-function, where recent 
focus has been on DMRG in small geometries such as narrow 
cylinders. On the other hand, the TRG approach has shown 
very promising results in 3D classical (and 2D quantum) sys- 



tems, while there has been less discussion on the inherent in- 
ability to account for all correlations in a large system. In-fact, 
it is possible that the central-site technique has been imple- 
mented in these studies in the past. 

From here, two possible directions to increase the effec- 
tiveness of real-space RG techniques in higher dimensions 
become apparent. The first would be find more efficient al- 
gorithms in the under-correlated regime. For example, the 
HOSRG was a step in this direction B35I1 . compared to earlier 
3D TRG algorithms. In the present work, it could be benefi- 
cial to replace the TTN with a MPS having a tree-like struc- 
ture, investigated already in Ref. ll42ll . The cost would reduce 
to 0(x 3 L 2 ), but one would still be limited to moderate system 
sizes. 

The second approach would be to use an ansatz that takes 
into account the correlation structure of the system. PEPS and 
MERA already exist to describe higher-dimensional quantum 
systems. In the realm of classical Markov networks, progress 
was made in Ref. l32tl to use a TRG-like approach to take 
account of all local correlations (for 2D classical systems). 
However, a more direct, MERA-like approach to 2D and 3D 
classical systems would represent a major advancement in this 
field. 
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